A Chaotic Antlion Optimization Algorithm for Text Feature Selection
نویسندگان
چکیده
Abstract Text classification is one of the important technologies in field text data mining. Feature selection, as a key step processing tasks, used to process high-dimensional feature sets, which directly affects final performance. At present, most widely selection methods academia are calculate importance each for through an evaluation function, and then select subsets that meet quantitative requirements turn. However, ignoring correlation between features effect their mutual combination this way may not guarantee best effect. Therefore, paper proposes chaotic antlion algorithm (CAFSA) solve problem. The main contributions include: (1) Propose (CAA) based on quasi-opposition learning mechanism chaos strategy, compare it with other four algorithms 11 benchmark functions. has achieved higher convergence speed highest optimization accuracy. (2) Study performance CAFSA using CAA when different models, including decision tree, Naive Bayes, SVM classifier. (3) compared eight three Chinese datasets. experimental results show can reduce number improve accuracy classifier, better than methods.
منابع مشابه
An Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification
In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...
متن کاملAn Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification
In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...
متن کاملAn Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملAn Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification
The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...
متن کاملText Feature Selection using Particle Swarm Optimization Algorithm
Text Categorization (TC) has become recently an important technology in the field of organizing a huge number of documents. Feature Selection (FS) is commonly used to reduce dimensionality of text datasets with huge number of features which would be difficult to process further. In this paper we have implemented an efficient feature selection algorithm based on Particle Swarm Optimization (PSO)...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computational Intelligence Systems
سال: 2022
ISSN: ['1875-6883', '1875-6891']
DOI: https://doi.org/10.1007/s44196-022-00094-5